TALP at GeoCLEF-2006: Experiments Using JIRS and Lucene with the ADL Feature Type Thesaurus
نویسندگان
چکیده
This paper describes our experiments in Geographical Information Retrieval (GIR) in the context of our participation in the GeoCLEF 2006 Monolingual English task. The TALPGeoIR system follows a similar architecture of the GeoTALP-IR system presented at GeoCLEF 2005 [2] with some changes in the Retrieval modes and the Geographical Knowledge Base. The system has four phases performed sequentially: i) a Keyword Selection algorithm based on a Linguistic and Geographical Analysis of the topics, ii) a Geographical Document Retrieval with Lucene, iii) a Document Retrieval task with the JIRS Passage Retrieval (PR) software, and iv) a Document Ranking phase. A Geographical Thesaurus (GT) has been build using a set of publicly available Geographical Gazetteers and the Alexandria Digital Library (ADL) Feature Type Thesaurus. In our experiments we have used JIRS, a state-of-the-art PR system for Question Answering (QA), for the GIR task. We also have experimented with an approach using both JIRS and Lucene. In this approach JIRS was used only for Textual Document Retrieval and Lucene was used tor detect the geographically relevant documents. These experiments show that applying only JIRS we obtain better results than combining JIRS and Lucene.
منابع مشابه
The GeoTALP-IR System at GeoCLEF-2005: Experiments Using a QA-based IR System, Linguistic Analysis, and a Geographical Thesaurus
This paper describes GeoTALP-IR system, a Geographical Information Retrieval (GIR) system. The system is described and evaluated in the context of our participation in the CLEF 2005 GeoCLEF Monolingual English task. The GIR system is based on Lucene and uses a modified version of the Passage Retrieval module of the TALP Question Answering (QA) system presented at CLEF 2004 and TREC 2004 QA eval...
متن کاملTALP at GeoQuery 2007: Linguistic and Geographical Analysis for Query Parsing
This paper describes our experiments on the Geographical Query Parsing pilot-task for English at GeoCLEF 2007. Our system uses some modules of a Geographical Information Retrieval system presented at GeoCLEF 2006 [3] and modified for GeoCLEF 2007. The system uses deep linguistic analysis and Geographical Knowledge to perform the task.
متن کاملN -Gram vs. Keyword-Based Passage Retrieval for Question Answering
In this paper we describe the participation of the Universidad Politécnica of Valencia to the 2006 edition, which was focused on the comparison between a Passage Retrieval engine (JIRS) specifically aimed to the Question Answering task and a standard, general use search engine such as Lucene. JIRS is based on n-grams, Lucene on keywords. We participated in three monolingual tasks: Spanish, Ital...
متن کاملTALP at GeoCLEF 2007: Using Terrier with Geographical Knowledge Filtering
This paper describes our experiments in Geographical Information Retrieval (GIR) in the context of our participation in the GeoCLEF 2007 Monolingual English task. Our system, called TALPGeoIR, follows a similar architecture of our previous system presented at GeoCLEF 2006 [2] with some changes in the Retrieval modes and the Geographical Knowledge Base. The system has four phases performed seque...
متن کاملThe UPV at QA@CLEF 2006
This report describes the work done by the RFIA group at the Departamento de Sistemas Informáticos y Computación of the Universidad Politécnica of Valencia for the 2006 edition of the CLEF Question Answering task. We participated in three monolingual tasks: Spanish, Italian and French. The system used is a slightly revised version of the one we developed for the past year. The most interesting ...
متن کامل